Fix: Sebulba PPO Metrics #108

EdanToledo · 2024-08-27T20:45:30Z

What?

Fix issues with actor steps per second and the number of timesteps logged.

Why?

It had legacy code based on how anakin operates and was incorrect.

How?

Correctly use num_envs_per_actor * rollout_length as this is what is consumed as a batch by the learner. This multiplied by num updates per eval gives the number of time steps consumed by the learning per eval step.

EdanToledo added 4 commits August 27, 2024 20:37

fix: issues with steps per second metrics in sebulba

db6a23e

chore: refactor to add more metrics for sebulba ppo

ad07426

chore: small change

44a6422

fix: clear pipeline before joining actors - fixes deadlock issue

d30e29a

EdanToledo self-assigned this Aug 27, 2024

EdanToledo merged commit 5284124 into main Aug 27, 2024
3 checks passed

EdanToledo deleted the EdanToledo/issue107 branch August 27, 2024 21:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Sebulba PPO Metrics #108

Fix: Sebulba PPO Metrics #108

EdanToledo commented Aug 27, 2024

Fix: Sebulba PPO Metrics #108

Fix: Sebulba PPO Metrics #108

Conversation

EdanToledo commented Aug 27, 2024

What?

Why?

How?